RGB-D Salient Object Detection with Ubiquitous Target Awareness
نویسندگان
چکیده
Conventional RGB-D salient object detection methods aim to leverage depth as complementary information find the regions in both modalities. However, results heavily rely on quality of captured data which sometimes are unavailable. In this work, we make first attempt solve problem with a novel depth-awareness framework. This framework only relies RGB testing phase, utilizing supervision for representation learning. To construct our well achieving accurate results, propose Ubiquitous Target Awareness (UTA) network three important challenges SOD task: 1) awareness module excavate and mine ambiguous via adaptive depth-error weights, 2) spatial-aware cross-modal interaction channel-aware cross-level interaction, exploiting low-level boundary cues amplifying high-level channels, 3) gated multi-scale predictor perceive saliency different contextual scales. Besides its high performance, proposed UTA is depth-free inference runs real-time 43 FPS. Experimental evidence demonstrates that not surpasses state-of-the-art five public benchmarks by large margin, but also verifies extensibility benchmarks.
منابع مشابه
RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning
In this work, we propose to utilize Convolutional Neural Networks (CNNs) to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced saliency detection as a CNN-based cross-modal transfer problem to bridge the gap between the " data-hungry " nature of CNNs and the unavailability of suff...
متن کاملLocal Background Enclosure for RGB-D Salient Object Detection - Supplementary Results
The purpose of this supplementary material is to examine in detail the contributions of our proposed Local Background Enclosure (LBE) feature. A comparison of LBE with the contrast based depth features used in state-of-the-art salient object detection systems is presented. The LBE feature is compared with the raw depth features ACSD [1], DC [3] and a signed version of DC denoted SDC on the RGBD...
متن کاملLearning Graph Matching for Object Detection from Rgb-d Images
We propose an optimization method for estimating parameters in graph-theoretical formulations of the matching problem for object detection. Unlike several methods which optimize parameters for graph matching in a way to promote correct correspondences and to restrict wrong ones, our approach aims at improving performance in the more general task of object detection. In our formulation, similari...
متن کاملFrustum PointNets for 3D Object Detection from RGB-D Data
While object recognition on 2D images is getting more and more mature, 3D understanding is eagerly in demand yet largely underexplored. In this paper, we study the 3D object detection problem from RGB-D data captured by depth sensors in both indoor and outdoor environments. Different from previous deep learning methods that work on 2D RGB-D images or 3D voxels, which often obscure natural 3D pa...
متن کاملSemantic Parsing for Priming Object Detection in RGB-D Scenes
The advancements in robot autonomy and capabilities for carrying out more complex tasks in unstructured indoors environments can be greatly enhanced by endowing existing environment models with semantic information. In this paper we describe an approach for semantic parsing of indoors environments into semantic categories of Ground, Structure, Furniture and Props. Instead of striving to categor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE transactions on image processing
سال: 2021
ISSN: ['1057-7149', '1941-0042']
DOI: https://doi.org/10.1109/tip.2021.3108412